Facebook Page Spam detection using Support Vector Machines based on n-gram model

نویسنده

  • Himani Chawla
چکیده

With social networks like Facebook, twitter reaching to the common masses, these have become the best target for spammers. The newest way to mislead and fraud viewers is Page Spam . Viewers are deceived to click on links to spam their connections, redirect to a fraudulent business or spread wrong information about famous figures, organizations and causes. This research aims to categorize such pages from authentic fan pages using support vector machines [2] and n gram models. Further an attempt has been made to improve our findings by some optimizations.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spam Detection Using Character N-Grams

This paper presents a content-based approach to spam detection based on low-level information. Instead of the traditional 'bag of words' representation, we use a 'bag of character n-grams' representation which avoids the sparse data problem that arises in n-grams on the word-level. Moreover, it is language-independent and does not require any lemmatizer or 'deep' text preprocessing. Based on ex...

متن کامل

A New Model for Email Spam Detection using Hybrid of Magnetic Optimization Algorithm with Harmony Search Algorithm

Unfortunately, among internet services, users are faced with several unwanted messages that are not even related to their interests and scope, and they contain advertising or even malicious content. Spam email contains a huge collection of infected and malicious advertising emails that harms data destroying and stealing personal information for malicious purposes. In most cases, spam emails con...

متن کامل

Cybercrime detection techniques based on support vector machines

This paper presents the cybercrime detection model by using support vector machines (SVMs) to classify social network (Facebook) dataset. We try to compare between three kinds of classification algorithms such as: SVMs, AdaBoostM1, and NaiveBayes in order to find a high percentage of classification accuracy. Finally, we conclude SVMs as the best classification algorithm, which uses different br...

متن کامل

An Approach for Spam E-mail Detection with Support Vector Machine and n-Gram Indexing

Many solutions have been deployed to prevent harmful effects from spam mail. Typical methods are either pattern matching using the keyword or method using the probability such as naive Bayesian method. In this paper, we proposed a classification method of spam mail from normal mail using support vector machine, which has excellent performance in binary pattern classification problems. Especiall...

متن کامل

Spam Filtering Based on Supervised Latent Semantic Features Extraction

Spam text is an universal phenomenon on the “open web”, including large-scale email systems and the growing number of Blogs. Handling this information overload is becoming an increasingly challenging problem, A promising approach is the using of content-based filtering. In this paper, our focus is placed on finding effective dimension reduction method for email Spam filtering, we apply a superv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014